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Abstract 


n five experiments, we examined the conditions under which participants remembered true and false 
information given as feedback. Participants answered general information questions, expressed their 
confidence in the correctness of their answers, and were given true or false feedback. In all five experiments, 
participants hypercorrected when they had made a mistake; that is, they remembered better the correct 
feedback to errors made with high compared to low confidence. However, in none of the experiments did 
participants hyper'correct' when false feedback followed an initially correct response. Telling people whether 
the feedback was right or wrong made little difference, suggesting that people already knew whether the 


feedback was true or false and differentially encoded the true feedback compared to the false feedback. An 


exception occurred when false feedback followed an error: participants hyper'corrected' to this false feedback, 
suggesting that when people are wrong initially, they are susceptible to further incorrect information. These 
results indicate that people have some kind of privileged access to whether their answers are right or wrong, 
above and beyond their confidence ratings, and that they behave differently when trying to remember new 


“corrective” information depending upon whether they, themselves, were right or wrong 


initially. The likely 


source of this additional information is knowledge about the truth of the feedback, which they rapidly process 


and use to modulate memory encoding. 


Significance 

This research addresses the issue of how people update 
both correct and incorrect semantic information, and 
whether their confidence in the truth of their initial re- 
sponses impacts that updating. When people are given 
corrective feedback after having committed a high-confi- 
dence error, they are more likely to remember that feed- 
back than when given feedback after having committed 
low-confidence errors. It is shown here that this hyper'cor- 
rection’ of high-confidence errors occurs regardless of 
whether the feedback is true or false. However, when they 
are given false feedback after having produced a high- ver- 
sus low-confidence correct response, such hypercorrec- 
tion does not occur. Two conclusions emerge. First, 
people have some knowledge of the truth of their answer 
above and beyond their stated confidence in its truth 
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when they produce the answer. It is likely that this add- 
itional knowledge stems from an evaluation of the truth of 
the feedback. Second, people are vulnerable to false infor- 
mation when they have made a high-confidence error. In 
this case, they appear to be unable to evaluate the truth of 
the feedback, and hence, hyper-encode the true and false 
information equivalently. 


Background 

While the concept of deliberately or intentionally con- 
veying false information dates back hundreds of years 
(e.g., Octavian’s and Mark Anthony’s propaganda war 
dating to ~33 BCE), what causes an individual to be sus- 
ceptible to false information or to believe true compared 
to false information has yet to be identified. Many stud- 
ies have shown that an individual’s confidence moderates 
their ability to update misinformation with true informa- 
tion. When an individual generates an incorrect answer 
to a question and is then provided with the correct 
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answer, they are considerably more likely to remember 
the correct answer when the initial error was made with 
high rather than low confidence (Butterfield & Mangels, 
2003; Butterfield & Metcalfe, 2001, 2006; Fazio & Marsh, 
2009, 2010; Huelser & Metcalfe, 2012; Kang et al., 2011; 
Kulhavy, Yekovich, & Dyer, 1976; Metcalfe, Butterfield, 
Habeck, & Stern, 2012; Metcalfe & Finn, 2012; Sitzman, 
Rhodes, & Tauber, 2014). This enhanced memory for 
corrective feedback to high-confidence errors is called 
the hypercorrection effect. It has been shown with chil- 
dren (Metcalfe & Finn, 2011), young adults (Butterfield 
& Metcalfe, 2006, and see Metcalfe, 2017, for review), 
and—to a lesser extent—older adults (Cyr & Anderson, 
2013; Eich, Stern, & Metcalfe, 2013; Metcalfe, Casal-Ros- 
cum, Radin, & Friedman, 2015; Sitzman, Rhodes, Tauber, 
& Liceralde, 2015). The hypercorrection effect occurs 
with both immediate and delayed testing (Butterfield & 
Mangels, 2003; Metcalfe & Miele, 2014), and it is found 
with general information questions as well as with concep- 
tual inferences, classroom materials, and lexical memory 
(Iwaki, Matsushima, & Kodaira, 2013; van Loon, Dunlosky, 
van Gog, van Merrienboer, & de Bruin, 2015). 

Despite converging evidence for the effect of confi- 
dence on later memory when an error was made ini- 
tially, in all previous experiments of the hypercorrection 
effect, the feedback given to participants has always been 
factually correct. Thus, the question remains of whether 
hyper'correction' would occur for false feedback given to 
an initially correct response. This question is relevant to 
many domains. For example, in the classroom, a student 
may hear another student’s incorrect response to a ques- 
tion. Likewise, “alternative facts” have become common- 
place in today’s parlance. Under what conditions are 
people likely to adopt misinformation and incorporate it 
into their belief system? 

Several theoretical positions concerning human mem- 
ory suggest that when people generate an answer that 
they assert with confidence to be true, they have no 
knowledge over and above those feelings of confidence 
of whether their answer is, in actuality, true or not. 
However, there is always the possibility that the ratings 
are made impulsively and could be made more accur- 
ately upon further deliberation (e.g., Buratti, Allwood, 
& Kleitman, 2013; Koriat, Lichtenstein, & Fischhoff, 
1980). However, aside from the possibility that there 
might be some latent knowledge that the person could 
theoretically tap into when making a judgment, stated 
confidence is taken to be a straightforward reflection of 
subjective knowledge. If true and false retrievals from 
memory are not distinguishable, and our evaluation of 
or confidence in the correctness of our answers is based 
on heuristics that should apply equally for right and 
wrong retrievals, then the correction of errors given 
true feedback may be indistinguishable from the 
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updating of correct answers by false feedback. If this is 
the case, then individuals may show the same confidence- 
related hyper'correction' to feedback when they had been 
correct just as they do when they were incorrect. On the 
other hand, while confidence may be highly associated 
with prior knowledge, other factors may could influence 
how a person responds to feedback and updates memory. 
For example, if they have some additional knowledge be- 
yond their stated confidence, or if underlying memory 
structures are different for correct vs. incorrect responses 
or for true or false feedback, then they might show a dif- 
ferent pattern of confidence-related memory updating. It 
is this basic idea that we test in the five experiments that 
follow. We investigate this question through the lens of 
the so-called hypercorrection effect, a phenomenon that is 
characteristic of the correction of high-confidence errors. 

In support of the conjecture that people have no way, 
over and above their confidence judgments, to distin- 
guish between whether they have generated the right an- 
swer or an error, Koriat’s (1993, 2008, 2011, 2012, 2018, 
and see Koriat & Goldsmith, 1996) accessibility/consen- 
suality model proposes that metacognitive confidence 
judgments—indicating people’s certainty about the cor- 
rectness of the answers they retrieve from memory—are 
based on the mnemonic cues of self-consistency, accessi- 
bility, familiarity, and fluency, even though they may 
hardly be aware of relying on these cues. Koriat (2008) 
notes: “It might be futile to expect that metacognitive 
judgments such as feeling of knowing or subjective con- 
fidence would have privileged access to the correct tar- 
get” (p. 954). Although Koriat acknowledges that high 
confidence is usually associated with correct responses, 
he argues that this typically found correlation does not 
occur because people have privileged access to the cor- 
rectness of their answers. Rather, it occurs because, gener- 
ally, people access correct rather than incorrect information 
from memory. However, he argues that when the wrong re- 
sponse is the consensual response, it, too, will be endorsed 
with high confidence. In that case, metacognitive judgments 
will be counter-diagnostic of the correctness of the answer 
(see, Koriat, 1993, 2018). 

Similarly, Lewandowsky, Ecker, Seifert, Schwarz, and 
Cook (2012) propose that many factors, including 
ideology and personal world views, influence people's 
overt evaluation of memorial information, as well as 
the likelihood that they will update misinformation 
when provided with correct information. They note 
that much of the information that people use to make 
assessments of the validity of information (Ecker, 
Lewandowsky, & Apai, 2011) is not necessarily diagnos- 
tic of correctness. Considering only the basic memory 
processes themselves, though, they suggest that there 
should be no fundamental cognitive difference between 
correcting errors when given the right answer and 
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overwriting correct information with wrong answers: 
“Correcting misinformation is cognitively indistinguish- 
able from misinforming people to replace their preexisting 
correct beliefs” (Lewandowsky et al., 2012, p. 124). Indeed, 
they argue that this purported indifference to the actual 
truth of responses may be responsible, in part, for why 
misinformation is so pernicious. 

Finally, when people produce an answer, even if they 
are unable to tell—over and above what is indicated by 
their confidence—whether that answer is correct or in- 
correct, they may, nevertheless, be able to engage add- 
itional cognitive processes when feedback is given. They 
may think that they are correct when they generate a 
high-confidence answer, but upon being given discrepant 
feedback realize—because of that additional informa- 
tion—that they had been wrong. Conversely, if they were 
correct but given false feedback, the feedback itself may 
provide additional information that solidifies the initially 
correct response—allowing them to resist the memory 
meddling that might accrue to false feedback. Indeed, 
Rich, Van Loon, Dunlosky, and Zaragoza (2017) recently 
found that belief in the feedback modulates the correc- 
tion of errors. Experiments 2 and 5 of the present series 
of experiments directly address the possibility that the 
feedback itself may be sufficient—at least in some 
cases—to allow people to ascertain whether their an- 
swers were in fact correct or not, and to update their 
memories differentially as a result of its truth value. 

The first question that we address, then, is whether 
people selectively remember feedback given to all high- 
compared to low-confidence responses, or whether, in- 
stead, they only hypercorrect when they were wrong ini- 
tially. The second question, which we _ investigate 
selectively following the commission of erroneous re- 
sponses, is whether the hypercorrection effect occurs re- 
gardless of whether the feedback given to errors is true 
or false. The third question we address relates to the me- 
morial role of the possible recognition of the truth value 
of the feedback. If people respond differently when they 
are wrong as opposed to when they are right, it is still pos- 
sible—as is consistent with the no-privileged-access 
view—that they did not know whether their answers were 
right or wrong, over and above their stated confidence, 
but they obtain further information from the feedback 
that allows them to modify their memory encoding. 

To investigate these issues, in a series of five experi- 
ments modeled on the basic hypercorrection paradigm 
(Butterfield & Metcalfe, 2001), we asked people to pro- 
duce answers to general information questions and to 
provide their confidence in their answers. Unlike other 
studies of the hypercorrection effect in which the feed- 
back was always the correct response, the feedback pro- 
vided following confidence ratings in the current set of 
experiments could be either true or false. In the first two 
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experiments, participants were provided with true feed- 
back when they made an error on the initial question, 
and false (but highly plausible) feedback when they were 
correct. In Experiments 3 and 4, for initially correct re- 
sponses, true feedback was provided half of the time, 
and the other half of the time participants were asked to 
remember false feedback. If people hypercorrect all high 
confidence answers--regardless of truth-value-- then all 
conditions should show the hypercorrection effect. For 
all experiments, then, memory for the feedback (whether 
true or false), as a function of people's confidence in the 
correctness of their original answers, as well as their 
knowledge of the truth-value of the feedback, was 
investigated. 


General method 

Design 

All five experiments were divided into three phases. In 
Phase 1, each participant was asked a series of ran- 
domly ordered general information questions. For each 
question, participants typed in their answer, and then 
indicated their confidence in its correctness on a slider 
scale ranging from “sure wrong” on the far left to “sure 
correct” on the far right, corresponding to values ran- 
ging from 0 to 100. Then, with the question, but not 
the participant’s answer, still on screen, feedback, con- 
sisting of a to-be-learned word in red, was presented 
for 2s. This sequence was repeated until all items had 
been presented. 

In Phase 2, each question was re-presented, in a random 
order. Participants were asked to type in the response to 
that question that had been presented in red during Phase 
1 (ie., the feedback). They were then asked to provide 
their confidence concerning whether they had correctly 
typed in the word that had been presented in red. 

Finally, in Phase 3, the participants were presented 
with all the questions again and were asked to provide 
the factually correct answer to each question, along with 
their confidence in the truth value of their answer. After 
the participants in Phase 3 provided what they believed 
was the correct answer, they were then shown the fact- 
ually correct answer along with the words: “The correct 
answer is: _..” This final, corrective feedback to all 
questions was included to ensure that participants left 
the experiments having been told, unambiguously, the 
correct answers to all questions. 

To evaluate whether participants hypercorrected in 
each of the experiments under different conditions, 
gamma (y) correlations (Goodman & Kruskal, 1954) 
were computed between the confidence on the initial 
responses, and whether, in Phase 2, the word that was 
given in red to be learned was recalled.” The y correl- 
ation between confidence in the original response and 
responding correctly in the final, Phase 3 test (when 
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the participant was asked to give the correct answer, 
and not necessarily the answer that was given in red as 
feedback) was also computed for all experiments. The 
criterion for hypercorrection was a y correlation signifi- 
cantly greater than zero. Note that “correct” in Phases 
1 and 3 reflects factual accuracy, whereas “correct” in 
Phase 2 reflects recall of the feedback word printed in 
red, regardless of its factual accuracy. 


Materials 

The general information questions (see Butterfield & 
Metcalfe, 2006) used in the current experiments 
included questions from Nelson and Narens (1980), vari- 
ous board games, and internet trivia sites. A latent se- 
mantic analysis (LSA; Landauer & Dumais, 1997; 
Landauer, Foltz, & Laham, 1998) of the relation of the 
question to the correct answer, and the question to the 
designated wrong answer, was conducted on the core set 
of 64 questions (given in Additional filr 1; additional 
questions were added for Experiment 4 only, to allow 
replication and to provide more data in each 
within-participant condition). The average LSA values® 
were well above zero (which would indicate no relation): 
M =.212, standard deviation (SD) = .209, ¢(63) = 8.10, p 
<.001, between the questions and the correct answers, 
and M = .248, SD = .14, t(62) = 13.56, p < .001, between the 
question and the incorrect answers. A paired t-test using 
question (instead of participant) in the analysis revealed 
no significant difference (M difference = -.03) between 
the LSA values of the correct answers’ and the incorrect 
answers’ relation to the question (¢(59) = 1.55, p =.125). 
As these analyses show, the incorrect answers that were 
designated a priori to be learned, in these experiments, 
were as closely associated with the questions as were 
the correct answers. The only discernible difference 
was in the truth value of the answers. 


Procedure 

Participants in all five experiments were tested individu- 
ally on iMac computers programmed in RealStudio (now 
called Xojo). They were instructed that they would be 
reading a list of general information questions and that 
after each question they should type in the correct answer. 
Participants were told not to make spelling mistakes. 
Spelling errors were manually checked and were rare. 
They were told that they would then be shown by the 
computer, in red, an answer that might be different from 
the answer that they had provided. Their job was to learn 
the answer in red for a later test in which they would be 
shown the question and would be asked to give the word 
that had been presented in red. All procedures used in this 
series of experiments conformed to Psychonomic Society 
ethical guidelines and were approved by the Columbia 
University Institutional Review Board. 
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Experiment 1 

The first experiment investigated recall of the feedback 
provided following high-confidence compared to low- 
confidence original responses under two different condi- 
tions: (1) when true feedback followed an initial error 
and (2) when false feedback followed an answer that was 
initially correct. The first situation has been extensively 
investigated and is the condition under which the stand- 
ard hypercorrection effect is found. The second situation 
has not previously been investigated, within the hyper- 
correction paradigm. The hypothesis—gleaned from the 
no-privileged-access view—was that hypercorrection of 
high-confidence responses would occur both when 
people had been wrong and were given the right an- 
swers, and when they had initially been right and were 
given wrong answers. 


Methods 

In Phase 1, participants were asked 64 randomly or- 
dered general information questions. If a participant’s 
answer was correct, they were given feedback in red 
that was incorrect but plausible (see Materials section). 
If the participant’s answer was wrong, the feedback in 
red was the correct answer. Participants were not told 
whether the to-be-learned item in red was true (i.e., 
factually correct) or false (i.e., factually incorrect). In 
Phase 2 of the experiment, the 64 questions were re- 
presented in a random order. Participants were asked 
to type in the response to a question that had been pre- 
sented in red and to provide their confidence concern- 
ing whether they had correctly typed in the word that 
had been presented in red. Finally, in Phase 3, the par- 
ticipants were presented with all the questions again 
and were asked to provide the factually correct answer 
to each question, along with their confidence in their 
answer. They were then shown the factually correct an- 
swer. Participants were 29 Columbia University stu- 
dents (8 male and 18 female, plus 3 who did not 
answer, with an average age of 21.43 years) who en- 
gaged in the experiment in exchange for a course 
credit. All except three participants also wrote short es- 
says for their class about the experiment. 


Results 
Phase 2 


Hypercorrection As can be seen on the left of Fig. 1, 
when participants had produced erroneous responses 
and were given the correct answer, a hypercorrection ef- 
fect was found such that the y correlation (M y =.29, 
SD =.29) between initial confidence and recall of the 
factually correct red response was significantly greater 
than zero (t(28) =5.26, p<.001). When, however, they 
had been correct initially and were given false feedback 
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Fig. 1 The y correlations between confidence in the factual accuracy of responses in the initial test and correct performance in Phase 2, in which 
recalling the red feedback word, regardless of whether it was true or false, was considered correct. The solid black bar represents the standard 
hypercorrection condition in which the participant answered incorrectly in Phase 1 and was given true feedback. The white bar provides the y 
correlations when the participant responded correctly in Phase 1 and false feedback was given. On the left are the results from Experiment 1, in 
which participants were not informed about whether the to-be-learned word, provided as feedback in red, was true or false. On the right are the 
results from Experiment 2, in which participants were informed, immediately upon being presented with the to-be-learned word in red, whether 


the feedback was true or false. Error bars indicate the standard error of the mean 


to recall, participants did not hyper'correct’ (M y =.11, 
SD = A7, t(26) = 1.22, p =.232). Although the former cor- 
relation differed from zero whereas the latter did not, 
the two y correlations were not significantly different 
from one another (t(26) = 1.60, p = .120). 


Recall The recall of the to-be-learned responses in red 
was significantly higher when the original answer had 
been correct (M =.68, SD =.20), compared to when the 
original answer had been incorrect (M=.58, SD=.19, 
(28) =2.31, p=.028), indicating that the incorrect 
answers given following correct responses were remem- 
bered better than were the correct responses given as 
feedback to errors. All recall means are presented in 
Table 1. 


Phase 3 
Relation of Phase 1 confidence to Phase 3 factual 
recall As can be seen on the left of Fig. 2, the y 


correlations between confidence in the original re- 
sponse and factually correct responding in the Phase 3 
test were significantly positive for answers that were 
correct initially but for which false feedback was given, 
indicating that participants reverted to their originally correct 
answers on the final test, especially when they had been highly 
confident originally (M y = .52, SD = .33, t(25) = 8.01, p < .001). 
However, even though they had been given the correct an- 
swers as feedback to their erroneous responses, participants 
in this experiment did not significantly hypercorrect in 
Phase 3. The mean y correlation was only .08 (SD = .27), 
which was not significantly different from zero (¢(28) = 
1.63, p =.115). The Phase 3 y correlations in the two con- 
ditions were significantly different from one another 
(t(25) = -5.38, p <.001). 


Recall As shown in Table 1, participants in Phase 3 gave 
the factually correct response marginally more fre- 
quently when they had been correct originally but had 


Table 1 Proportion of correct recall by phase depending on whether the given answer in Phase 1 was correct or incorrect and 


whether the feedback given to the response was true or false 


Phase Phase 1 Feedback Experiment 
response 1 5 3 4 5 
2 Incorrect True 58 (.19) 67 (.15) 65 (15 53 (15 56 (.24) 
False 62 (.16 54 (17 43 (.19) 
Correct False 68 (.20) 58 (.21) 70 (.30) 71 (21 56 (.24) 
True 99 (.03 96 (21 95 (.15) 
3 Incorrect True AB (.20) 66 (.16) 56 (14 42 (14) 63 (.17) 
False 02 (.03 03 (.03 55 (.20) 
Correct False 60 (.31) 85 (.24) 69 (31 87 (.22 99 (.05) 
True 99 (.03 95 (12 97 (.07) 
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Fig. 2 The y correlations between confidence in the factual accuracy of responses in the initial test and correct performance in Phase 3, in which 
participants were retested and were asked to provide the factually correct answer to each question. The solid black bar represents the standard 
hypercorrection condition in which an error had been committed in Phase 1 and true feedback was given. The white bar represents the condition in 
which the participant had originally been correct in Phase 1 and false feedback was given. On the left are results from Experiment 1, in which participants 
were not informed about whether the to-be-learned word, provided as feedback in red, was true or false. On the right are results from Experiment 2, in 
which participants were informed, immediately upon being presented with the to-be-learned word in red, whether the feedback was true or false. Error 
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been given false feedback (M = .60, SD = .31) than when 
they had been wrong originally but had been given cor- 
rect feedback (M=.48, SD=.20, ¢(28) =1.99, p=.057). 
Performance in Phase 3, where participants were asked 
to provide the factually correct answers to the questions, 
declined by 40% for items for which participants had 
been correctly initially, as a function of false feedback. 


Discussion 
The results of Experiment 1 indicate that participants 
hypercorrected on the immediate test when they were 
given correct feedback to their own errors. The results 
also reveal that participants did not hyper'correct' on the 
immediate test when they had made a correct response 
with high confidence but were then given false feedback. 
This finding weighs against the no-privileged-access view 
and is contrary to the hypothesis that people cannot dis- 
tinguish true from false information and will, therefore, 
hypercorrect solely as a function of their confidence. 
Furthermore, although a hypercorrection effect was 
found in the standard condition with the immediate test, 
this effect was not significantly different from zero on 
the retest after a slight delay. Several previous studies 
have shown that the standard hypercorrection effect per- 
sists after a delay of a week or more under conditions in 
which the factually correct answer is provided (Butter- 
field & Mangels, 2003; Metcalfe & Miele, 2014). It ap- 
pears that the uncertainty introduced by including some 
questions for which incorrect feedback followed correct 
answers may have undermined the stability of the hyper- 
correction effect. 


Experiment 2 

The results of Experiment 1 suggest that people may 
have some knowledge, other than their stated 
confidence ratings, about whether their answers were 
right or wrong that was likely responsible for the dif- 
ferential memory for high-confidence error feedback 
compared to high-confidence correct feedback. It 
seems likely that people quickly assessed the truth of 
the feedback and used this truth value to modulate 
their memory encoding of the to-be-remembered 
word in red. If there was such a modulation, it likely 
occurred because people knew, without anyone telling 
them, whether the feedback was true or false. Accord- 
ingly, Experiment 2 was identical to Experiment 1, ex- 
cept that after each to-be-learned red feedback word 
was presented in Phase 1, a message box told the par- 
ticipants, explicitly, whether the red word was the fact- 
ually correct answer or not. Participants were also told, 
at the outset of the experiment, that when they got an 
answer correct, the computer would present an incor- 
rect alternative for them to learn, and when they got 
the answer wrong, it would present the correct alterna- 
tive for them to learn. Participants were 29 Columbia 
University students (18 female and 10 male, plus 1 who 
did not answer, with an average age of 19.7 years). 


Results 
Phase 2 


Hypercorrection As can be seen on the right of Fig. 1, 
participants hypercorrected when they were given the 
correct answers following an error (M y =.27, SD =.23, 


Metcalfe and Eich Cognitive Research: Principles and Implications 


t(27) =6.19, p<.001). However, they showed no 
hyper'correction' when their original responses had been 
correct and they were asked to recall false feedback in 
Phase 2 (M y=.02, SD=.44, ¢(26) =.25, p=.800). The 
difference between these two conditions was significant 
(t(26) = 2.58, p = .016). 


Recall There was a marginal difference between recall 
of the to-be-learned responses in red when the original 
answer had been correct (and thus, false feedback was to 
be recalled; M=.58, SD=.21), compared to when the 
original answer had been incorrect (and thus, true feed- 
back was to be recalled; M = .67, SD = .15, t(27) = — 1.84, 
p =.078), such that the recollection of the true feedback 
following an error was higher than the recall of false 
feedback following an initially correct response. 


Phase 3 


Relation of Phase 1 confidence to Phase 3 factual 
recall As shown on the right of Fig. 2, the hypercor- 
rection effect, in the standard condition, persisted 
strongly in Phase 3 (M y=.25, SD=.19, ¢(27) = 6.93, 
p<.001). When participants had initially given the 
correct answer, were told that they were correct, but 
were then given false feedback to remember, they 
were more likely to correctly produce high-confidence 
than low-confidence (initially correct) responses in 
Phase 3 (M y =.40, SD = .49, ¢(11) = 2.79, p =.017). 


Recall Participants in Phase 3 gave the factually correct 
responses more frequently when they had been correct 
originally but given false feedback (M = .85, SD = .24) than 
when they had been wrong originally but had been given 
correct feedback (M = .66, SD = .16, t(27) = 6.30, p < .000). 


Comparison of Experiments 1 and 2 

Phase 2 

Hypercorrection 

To examine the effect of uncertainty about whether the 
feedback was true or false, the y correlations that are 
relevant to hyper'correcting’ from Experiment 1 were 
compared to those of Experiment 2. A 2 x 2 analysis of 
variance (ANOVA), treating experiment as a between- 
participant independent variable and true or false feed- 
back following their initial response as a within-partici- 
pant variable, showed no effect of experiment (F(1, 52) 
<1). There was a main effect of condition (F(1, 52) = 8.53, 
p =.005, N, = .14, 1 — B =.82), such that there was a greater 
hypercorrection effect to high-confidence responses follow- 
ing errors (M =.27, SD=.27) than following correct re- 
sponses (M=.07, SD=.45). There was no significant 
interaction between experiment and condition (F(1, 52) < 


(2019) 4:4 


Page 7 of 18 


1). Thus, the pattern of results across both experiments 
was equivalent. Participants’ hypercorrection performance 
for Phase 2 was the same whether they were given explicit 
experimenter-provided information about the correctness 
of the feedback or not. 


Recall 

Correct recall of the word in red in Phase 2 did not differ 
as a function of either condition (false feedback following 
a correct response vs. true feedback following an incorrect 
response) or experiment, both Fs(1, 55) < 1. There was an 
interaction between condition and experiment (F(1, 55) = 
8.53, p=.005, 7, = .13, 1 — B = .82), such that being cor- 
rect in Phase 1 and receiving false feedback resulted in 
marginally significantly better performance in Experiment 
1 (M=.68, SD =.20) than in Experiment 2 (M=.58, SD 
=.21, ¢(55) = 1.908, p=.06). On the other hand, being 
wrong in Phase 1 and receiving true feedback resulted in 
marginally significantly better performance on Phase 2 in 
Experiment 2 (M=.67, SD=.15) than in Experiment 1 
(M = 58, SD =.19, t(55) = 1.84, p = .07). 


Phase 3 

Relation of Phase 1 confidence to Phase 3 factual recall 
There was no overall effect of experiment (F< 1). There 
was a main effect of condition (F(1, 52) = 8.53, p = .005, 
yw, = .14, 1 — B=.82), such that there was a greater 
hypercorrection effect to high-confidence responses fol- 
lowing correct initial responses (M = .49, SD = .39) than 
following initial errors (M=.10, SD=.23). There was, 
however, no interaction between experiment and condi- 
tion (F(1, 36) = 2.04, p = .16). 


Recall 

There was an overall effect of experiment such that recall 
of the factually correct answer in Phase 3 was higher in 
Experiment 2 (M=.76, SD=.04) than in Experiment 
1 (M= 5A, SD = 04, A, 55) = 17.17, p< O01, 7, = 24,1 - B= 98). 
There was a main effect of condition such that recall 
was higher when participants had been correct ini- 
tially and given false feedback (M =.73, SD =.04) than 
when they had been wrong and provided with correct 
feedback (M=.57, SD=.02, F(1, 55) =20.43, p<.001, 
4, = -27, 1 — B=.99). There was no interaction be- 
tween experiment and condition (F(1, 55) <1). 


Discussion 

Together, the results of Experiments 1 and 2 suggest 
that increasing people’s certainty about the correctness 
of the feedback had little impact on hypercorrection 
and recall performance, as if participants knew whether 
they were right or wrong to begin with. There was no 
difference in the standard hypercorrection effect or in 
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the lack of a hyper'correction' effect when people were 
correct and given false feedback. There was, however, a 
difference in the persistence of the standard hypercor- 
rection effect after a delay. In Experiment 1, in which 
they were uncertain, the delayed hypercorrection effect 
was smaller than in Experiment 2, in which there was 
no uncertainty. Further, people’s factually correct per- 
formance on the final test was different depending on 
whether the participants were explicitly told that the 
false feedback had been incorrect. 


Experiment 3 

In the first two experiments, the truth or falsity of the 
feedback was perfectly negatively correlated with 
whether the participant’s initial answer had been right or 
wrong. Given that the data indicate that people 
responded differently to questions on which they had 
been correct and incorrect, this correlation may have 
been problematic. It is possible, for instance, that partici- 
pants might have noticed it and used this information to 
alter their learning strategies. Accordingly, in the last 
three experiments, the truth or falsity of the feedback 
was randomly assigned with respect to the correctness 
of participants’ initial responses. 


Method 

In Experiment 3 there were four conditions: (1) incorrect 
followed by true feedback, (2) incorrect followed by false 
feedback, (3) correct followed by false feedback, and (4) 
correct followed by true feedback. As in the first two ex- 
periments, participants were retested in Phase 3 for the 
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factually correct answers, made a confidence rating about 
their response, and then were told the factually correct an- 
swer for each question. Participants were 24 Columbia 
University students (12 female and 12 male, with an aver- 
age age of 20.25 years). The participants received a bonus 
course credit for participating in the experiment. 


Results 
Phase 2 


Hypercorrection* As is shown in the left of Fig. 3, a 
hypercorrection effect was observed in the standard condi- 
tion in which participants made an error and then received 
true feedback (M y=.26, SD=.44, ¢(23) = 2.91, p =.008). 
There was also a hyper'correction' effect when participants 
had made an initial error and false feedback was given to 
them (M y =.40, SD =.38, £(23) = 5.04, p< .001). To assess 
whether this effect stemmed from participants receiving as 
false feedback the same, erroneous answer that they had 
given as their response in Phase 1,” a second hyper'correc- 
tion’ y was computed in which all trials in which the ori- 
ginal error was the same as the to-be-learned (false) 
feedback in Phase 2 were removed. The resulting y correl- 
ation was still significantly positive (M y = .41, t(23) = 4.40, 
p<.001). In the condition in which the initial response was 
correct and false feedback was given, the y correlation be- 
tween original confidence and production of the response 
in red was not significantly different from zero (M =.09, 
SD = 80, ¢(15) = .49, p =.630), indicating that participants 
in this condition did not hyper'correct’. The difference 
among the three conditions was not significant in this 
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Fig. 3 The y correlations between confidence in the factual accuracy of responses on the initial, Phase 1 test and correct performance in Phase 2, in which 
recalling the feedback word, regardless of whether it was true or false, was considered correct. The solid black bar represents the standard hypercorrection 
condition in which the participant answered incorrectly in Phase 1 and was given true feedback. The hatched bar represents the condition in which the 
participant answered incorrectly in Phase 1 and was given false feedback. The white bar indicates the condition in which the participant answered 
correctly in Phase 1 and was given false feedback. When people were correct in Phase 1 and then were given correct feedback, they were, of course, 
Virtually always correct, making it impossible to compute gammas in this condition. So, this condition is not included in this graph. In the left and center 
are the results from Experiments 3 and 4, respectively, in which participants were not informed about whether the to-be-learned word, provided as 
feedback, was true or false. On the right are the results from Experiment 5 in which participants were informed, irmmediately upon being presented with 
the to-be-learned word in red, whether that feedback was true or false, and, if false, what the true answer was. Error bars indicate the standard error of the 
mean 
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experiment (F(2, 30) < 1, 1, = .05, 1 — B =.16), possibly be- 
cause many participants did not provide observations in all 
three conditions, and the power was low. 


Recall As indicated in Table 1, when participants were 
correct in Phase 1 and were given true feedback, they 
were virtually always correct in Phase 2 (M = .99, SD = .03). 
A repeated measures ANOVA was conducted on the 
remaining three conditions (correct in Phase 1 followed by 
false feedback, incorrect in Phase 1 followed by true feed- 
back, and incorrect in Phase 1 followed by false feedback). 
The means, respectively, were: M =.70, SD =.30; M =.65, 
SD = .15; and M = .62, SD=.16. The main effect of condi- 
tion was not significant (F(2, 46) = 1.65, p = .204, , 1 = 07, 
1 — B =.33). 


Phase 3 


Relation of Phase 1 confidence to Phase 3 factual 
recall The y correlations, shown in the left panel of Fig. 4, 
were computed for Phase 3 of the experiment, in which 
participants were asked to provide the factually correct 
answer to each question. When participants had provided 
the correct answer in the first place and were given true 
feedback, they were nearly always correct and thus, there 
was not enough variability to compute y. When they had 
been correct in the first place and were given false feed- 
back, the y correlations were high, in keeping with their 
original confidence in the correctness of their responses 
(M y=.72, SD=.63, t(18) =4.98, p<.001). This result 
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Fig. 4 The y correlations between confidence in the factual accuracy of responses on the initial, Phase 1 test and correct performance in 
Phase 3, in which participants were retested and were asked to provide the factually correct answer to each question. The solid black bar 
represents the standard hypercorrection condition in which the participant answered incorrectly in Phase 1 and was given true feedback. The 
hatched bar represents the condition in which the participant answered incorrectly in Phase 1 and was given false feedback. The white bar 
indicates the condition in which the participant answered correctly in Phase 1 and was given false feedback. In the left and center are the 
results from Experiments 3 and 4, respectively, in which participants were not informed about whether the to-be-learned word, provided as 
feedback, was true or false. On the right are the results from Experiment 5 in which participants were informed, immediately upon being 
presented with the to-be-learned word in red, whether that feedback was true or false, and, if false, what the true answer was. Error bars 
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indicates that they knew that the false feedback was false 
and stuck to their initial answers as being correct. When 
an error had been committed in Phase 1 and true infor- 
mation was presented as feedback to be learned, the y cor- 
relations were not significantly different from zero in 
Phase 3 (M=.10, SD=.45, t(23) = 1.04, p=.31). Thus, if 
participants were uncertain about the correctness of the 
response given as feedback, the delayed hypercorrection 
effect was not different from zero, a result which echoes 
that found in Experiment 1. If participants were wrong 
initially and false feedback was given, there were very 
few factually correct responses in Phase 3, and only 8 
participants had sufficient data for a y correlation to be 
computed. It is not surprising, then, that the mean y 
was not different from zero (M y =.22, SD=.85, t(7) 
=.740, p =.483). As with the Phase 2 y correlations, 
there were many missing data points, and so the differ- 
ences among the y correlations were not significant 
(F(2,36) = 2.49, p=.10, m7, = .122, 1 — B = 468). 


Recall As indicated in Table 1, when participants were 
correct in Phase 1 and were given true feedback, they 
remained correct on the final, Phase 3 test with a high 
probability (M=.99, SD=.03). A repeated measures 
ANOVA was conducted on the remaining three condi- 
tions. The main effect of condition was significant (F(2, 
46) = 89.53, p<.001, 7, = .79, 1 — B=1). Following an 
error in Phase 1, correct factual recall was significantly 
worse when participants had been given false feedback 
(M = .02, SD = .03) versus true feedback (M = .56, SD = .14, 
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(23) = 19.57, p< .001). When participants had been cor- 
rect in Phase 1 and were given false feedback, their prob- 
ability of producing the correct response in Phase 3 was 
M =.69, SD =.31. They exhibited better performance in 
Phase 3 when they had initially been correct and were 
given false feedback than when they had initially been 
wrong and received false feedback (¢(23) = 10.89, p < .001). 
Correct responding in the latter condition was not 
different from zero. In addition, being correct initially, 
even when the correct response was followed by false 
information, resulted in better accuracy on the final 
test than receiving true feedback following an error 
(t(23) = 2.02, p=.055). Finally, recall was higher when 
an initially correct Phase 1 response was followed by true 
feedback as opposed to false feedback (#(23) = 4.94, p < .001). 


Discussion 

The hypercorrection results from the first two experi- 
ments were further clarified and bolstered by Experi- 
ment 3. Participants hypercorrected when they had 
made an error and were given corrective feedback, but 
did not hyper'correct' when they had been correct and 
were given false feedback. When they had been right, 
it seems plausible that they may have had some know- 
ledge that the feedback they were given was wrong 
(when it was). Interestingly, though, they also 
hyper'corrected' when they had made an error in 
Phase 1 and they were given false feedback following 
their error. Participants did not appear to be able to 
recognize misinformation and exclude it from being 
encoded when they had made an error. 


Experiment 4 

A limitation of Experiment 3 was that it was sometimes 
not possible to compute y correlations, because it is ne- 
cessary to have at least one correct and one incorrect re- 
sponse and several different confidence levels in every 
condition. For this reason, Experiment 4 was a replica- 
tion with a slight variation. We doubled the number of 
general information questions (and thus, experimental 
trials), asking participants 128 questions rather than 64, 
and increased the number of participants recruited. This 
experiment frequently took participants several hours to 
complete. Participants were 40 Columbia University stu- 
dents (30 female and 8 male, plus 2 who did not answer, 
with an average age of 20.92 years). The participants re- 
ceived a bonus course credit for participating. 


Results 
Phase 2 


Hypercorrection The Phase 2 y correlations are shown in 
the center of Fig. 3. In the standard condition, a hypercor- 
rection effect was evident (M y = .32, SD =.24, £(39) = 8.36, 
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p<.001). There was also a hyper'correction' effect when the 
original answer was incorrect and false feedback was given, 
both when trials on which participants had received false 
feedback in Phase 2 and the same erroneous answer that 
they had given in Phase 1 were included (M y=.12, SD 
= .27, (39) = 2.82, p=.008), and when those trials (which 
occurred with a M probability of .029) were removed (M y 
= .21, SD=.25, t(39) = 5.30, p< .001). When the initial re- 
sponse was correct and false feedback was given, the y cor- 
relation between original confidence and production of the 
response in red was significantly negative (M =-.17, SD 
= 49, t(35) = — 2.06, p= .047), indicating a 
confidence-related resistance to remembering the item in 
red. The y correlations in the three conditions were signifi- 
cantly different from one another (F(2,70) = 19.76, p< .001, 
UF = .36, 1 — B = 1), with the condition in which the original 
answers were correct followed by false feedback showing 
lower y correlations than when the error was followed by 
correct feedback (¢(35) = - 2.91, p =.006) or by erroneous 
feedback (¢(35) = - 2.83, p = .008). These two latter condi- 
tions both showed y correlations that were greater than 
zero and did not differ from one another (¢ < 1). 


Recall When participants were correct in Phase 1 and 
were given true feedback, they were virtually always cor- 
rect in Phase 2 (M =.96, SD=.21). A repeated measures 
ANOVA on the proportion of items recalled in Phase 2 
for the three remaining conditions revealed a significant 
main effect (F(2, 78) = 28.98, p< .001, ", = 43,1 -B=1). 
When participants had been correct originally and were 
given false feedback, recall of the items in red was .71, 
SD = .21. When participants had been incorrect origin- 
ally, recall of the items in red was .53, SD=15 when 
they were given true feedback and .54, SD=.17 when 
they were given false feedback (tf <1). The condition in 
which the original answers were correct followed by 
false feedback differed from both of these conditions 
(t(39) =5.82, p<.001, and 1¢(39)=5.70, p<.001, 
respectively). 


Phase 3 


Relation of Phase 1 confidence to Phase 3 factual 
recall As shown in the center of Fig. 4, when an error 
had been made in Phase 1 and true feedback was given, 
a hypercorrection effect emerged in Phase 3 (M y=.10, 
SD = .22, ¢(39) = 2.85, p =.007). When an error had been 
made in Phase 1 and false feedback was given, the y was 
not significantly different from zero in Phase 3 (M y =.05, 
SD =.25, (39) =1.16, p>.25). Finally, when participants 
had provided the correct answer in the first place, were 
given false feedback, and were then retested for the fact- 
ually correct answer on the final test, the y correlations 
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were positive and high (M y=.72, SD =.63). Participants 
recalled the correct answers consistently with their original 
confidence in the correctness of their Phase 1 responses 
(t(20) = 4.91, p<.001). The main effect of condition was 
significant (F(2, 40) = 11.42, p < .001, 1, = .36,1- 6 =.99). 


Recall For Phase 3 accuracy, when participants had 
been correct originally and were given true feedback, 
they remained correct, as expected (M = .95, SD =.12). 
When participants had initially been correct and re- 
ceived false feedback, their final factually correct per- 
formance was .87, SD=.22, unlike when they were 
wrong and received true feedback (M=.42, SD=.14, 
(39) = 12.10, p<.001). When they had been incorrect 
originally and were given false feedback, they were 
nearly always wrong, not surprisingly (M = .03, SD = .03). 
However, when participants had been wrong originally 
but received true feedback, there was a sizable im- 
provement in accuracy on the final test (¢(39) = 17.89, 
p<.001). Receiving false, relative to true feedback, fol- 
lowing a correct response in Phase 1 resulted in a 
loss of accurate performance on the final test (from 
.95 to .87, (39) = 2.78, p=.008). The false feedback— 
without any validation of the veracity of that feed- 
back— hurt participants knowledge of what was in 
fact true. 


Discussion 

As had been found in Experiment 3, in this experiment, 
participants hypercorrected when they had made an 
error and were given corrective feedback. They also 
hyper'corrected’ when given false information when 
they had been wrong initially. However, they did not 
hyper'correct’ when they had been correct and were 
given false feedback. 


Experiment 5 

In the final experiment, participants were told directly 
which answers were correct and which were incor- 
rect, and, if incorrect, what the correct answer was. 
Otherwise, Experiment 5 was like Experiments 3 and 
4. Participants were 25 Columbia University students 
(7 male and 17 female, plus 1 who did not answer, 
with an average age of 19.21 years). A course credit 
was offered in exchange for participating in the ex- 
periment. None of the participants had taken part in 
any of the other experiments in this series. Data from 
one participant were eliminated because of their low 
accuracy in Phase 1, which prohibited most analyses 
from being computed (only 3 out of 64 answers cor- 
rect, or <5%). 
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Results 
Phase 2 


Hypercorrection As shown in the right of Fig. 3, in 
Phase 2, the mean y correlation for the standard con- 
dition was significantly greater than zero (M y =.39, 
SD = .36, t(23) =5.294, p<.001). The y correlations for 
false feedback following an incorrect response also showed 
hypercorrection (M y = .19, SD = .38, ¢(23) = 2.457, p = .02), 
although the magnitude of the hypercorrection in these 
two conditions differed. There was a larger hypercorrection 
effect when true as opposed to when false feedback was 
given (t(23) = 2.20, p =.038). When the questions had ori- 
ginally been answered correctly and false feedback was pro- 
vided, the y correlations were not significantly different 
from zero (M = -.01, SD =.55, t(20) < 1). 


Recall When participants were correct in Phase 1 and 
were given true feedback, they were virtually always cor- 
rect in Phase 2 (M =.95, SD=.15). A repeated measures 
ANOVA on the proportion of items recalled in Phase 2 
for the three remaining conditions revealed a significant 
main effect of condition (F(2, 46) = 7.74, p = .001, ce = .25, 
1 — 6 =.94). Participants did recall the false feedback, even 
when they had initially been correct in Phase 1 (M =.56, 
SD = .24), although recall of this false information following 
a correct response initially was lower than recall of the true 
information following a correct response (t¢(23) = 6.90, 
p<.001). There was no difference in recall of false 
information following a correct answer versus true 
feedback following an incorrect response (¢(23) = 1.00, 
p <.328). However, when participants had been wrong 
in Phase 1, they remembered the factually true and false 
feedback differentially in Phase 2 (¢(23) = 2.79, p = .010). 
The true information was recalled significantly more 
often than the false information. 


Phase 3 


Relation of Phase 1 confidence to Phase 3 factual 
recall As shown in the right of Fig. 4, in Phase 3, where 
the criterion for correctness was being factually correct, 
the two y correlations for the conditions in which par- 
ticipants had originally committed an error both 
showed hypercorrection. Presumably, even though 
people were tasked with remembering the item in red 
in the second phase, which could be factually incorrect, 
the presentation of the true answer, in this experi- 
ment—regardless of whether they also had to remem- 
ber a false answer or not—led to hypercorrection in 
this third phase. The y correlations were significantly 
positive for wrong answers followed by false feedback 
(M = .43, SD =.33, (23) = 6.47, p<.001) and for wrong 
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answers followed by correct feedback (M = .35, SD =.36, 
t(24) = 4.73, p< .001). The two conditions were not differ- 
ent from one another (f(23) <1). When participants had 
been correct initially and were given false feedback 
(and were told that it was false), the Phase 3 y was posi- 
tive, .37, SD=.71, suggesting that participants were 
more likely to be correct on the final test when they 
had had high compared to low confidence in their ori- 
ginal responses (£(9) = 1.63, p = .138). 


Recall The most prominent result that emerged as a 
function of being told which answers were right and 
wrong was during Phase 3, when people were asked to 
recall the correct answers. If they had been correct in 
Phase 1, they almost never made a mistake in Phase 3, 
despite having been exposed to and remembering the 
false feedback (M=.99, SD=.05). Performance in this 
condition was better than when participants had ini- 
tially been incorrect and received true feedback (M 
= .63, SD=.17, (23) = 10.52, p<.001). When they had 
made a mistake in Phase 1, they also recalled the cor- 
rect answers better when they had been asked to re- 
member true compared to false feedback (M =.55, 
SD =.19, t(23)=2.29, p=.032). Likewise, participants 
showed better Phase 3 performance when they re- 
ceived false feedback following initially correct versus 
incorrect Phase 1 responses (¢(23) = 12.03, p<.001). 
Strikingly, there was no difference in final recall per- 
formance when participants had been correct origin- 
ally as a result of whether the to-be-remembered 
feedback they received was true or false (f(23) = 1.23, 
p =.233). Thus, the decrement in final correct per- 
formance seen in previous experiments appears to be 
attributable to participants having been uncertain 
about the truth of their original responses in the face 
of false information. When given false information 
but informed of its truth value, the false information 
had no effect on final factually correct recall. 


Comparison of Experiment 5 with Experiments 3 
and 4 

To analyze whether having explicit knowledge about the 
truth of the feedback impacted y correlations and the pro- 
portion of words correctly recalled in Phases 2 and 3, 
ANOVAs treating experiment (3 and 4 vs. 5) as a 
between-participant factor and condition (false feedback 
following initial correct responses, false feedback following 
initial incorrect responses, and true feedback following ini- 
tial errors) as a within-participant factor were conducted. 


Phase 2 

Hypercorrection 

The main effect of experiment, and the interaction 
between condition and experiment, were not significant 
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(F< 1). However, the main effect of condition was signifi- 
cant (F(2, 142) = 10.91, p< .001, 1 = .13, 1 — B= .99). Be- 
ing incorrect in Phase 1 and receiving false feedback (M 
=.21, SD=.35) resulted in a lower average y correlation 
than being incorrect in Phase 1 and receiving true feed- 
back (M = .32, SD = .34, t(87) = -2.20, p = .03), but a higher 
y correlation than being correct originally but receiving 
false feedback (M=-.06, SD =.59, t(72) = 3.51, p= .001). 
Wrong answers followed by true feedback (M =.32, SD 
= .35) resulted in higher y correlations than did correct 
answers followed by false feedback (¢(73) = 4.39, p < .001). 


Recall 

There was a main effect of experiment such that the 
proportion of correct Phase 2 responses was higher in 
the experiments in which participants were not informed 
(3 and 4) than in the experiment in which they were in- 
formed (Experiment 5; F(1, 86) = 5.94, p = .017, Ne = 17, 
1 — B=1). There was also a main effect of condition 
(F(2, 172) = 16.97, p<.001, 72 = .12, 1 — B=.92). Being 
correct in Phase 1 and receiving false feedback (M = .66, 
SD =.25) resulted in better recall of the red word in 
Phase 2 than being incorrect on Phase 1 and receiving ei- 
ther true feedback (M = .56, SD = .18, t(87) = 4.43, p < .001) 
or false feedback (M =.53, SD = .18, (87) =5.81, p< .001). 
Being wrong in Phase 1 and receiving true feedback also re- 
sulted in significantly better Phase 2 recall of the red word 
compared to being wrong and receiving false feedback 
(t(87) = 2.25, p = .027). These main effects were moderated 
by a significant interaction between condition and experi- 
ment (F(2, 172) = 4.98, p = .008, 7, = .06, 1 — B=.81). The 
proportion of correct recall for the red word when false 
feedback followed a correct response was greater for Exper- 
iments 3 and 4 (M =.71, SD=.25) than for Experiment 5 
(M =.56, SD =.25, t(86) = 2.47, p=.015). Correct recall 
of the red word when false feedback followed an in- 
correct response was greater for Experiments 3 and 
4 (M=.57, SD=.17) than for Experiment 5 (M =.43, 
SD=.19, ¢(86) = 3.29, p=.001), but not when true 
feedback followed an incorrect response (M =.57, 
SD = .16 for Experiments 3 and 4 vs. M=.56, SD = .24 
for Experiment 5, t(86) <1). 


Phase 3 

Relation of Phase 1 confidence to Phase 3 factual recall 
The main effect on y correlations of experiment was not 
significant (F(1, 36) = 2.01, p =.165). The main effect of 
condition was marginally significant (F(2, 72) = 3.01, 
p =.056, y; = .08, 1 — B= .57). Whereas the differ- 
ence between the y correlations associated with be- 
ing incorrect in Phase 1 and receiving false feedback 
(M=.19, SD=.41) and being incorrect in Phase 1 
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and receiving true feedback (M=.18, SD=.35) was 
not significant (¢(71) <1), being correct and receiving 
false feedback (M =.57, SD=.55) resulted in higher y 
correlations than both being incorrect and receiving 
false feedback (£(37) = 2.99, p =.005) and being incor- 
rect and receiving true feedback (M=.14, SD=.35, 
t(49) = 4.20, p<.001). There was also a significant 
interaction between condition and experiment (F(2, 
72) = 3.53, p = .034, 7, = .09, 1 — B= .64). The y cor- 
relations for initially incorrect responses followed by true 
feedback in the uninformed experiments was .35, SD = .36, 
and .10, SD =.33, for the informed experiment, which was 
significant (t(70) = 3.85, p< .001). For initially incorrect re- 
sponses followed by false feedback, the y correlations were 
M = 43, SD =.33, and M =.07, SD = 40, respectively, which 
was also significant (¢(86) = 3.09, p =.003). In both cases, 
the y correlations were significantly higher in Experiments 
3 and 4 (uninformed) than they were in Experiment 5 (in- 
formed). There was no difference for initially correct re- 
sponses followed by false feedback (¢(48) = -1.19, p = .24). 


Recall 

There was an overall effect of experiment such that re- 
call of the factually correct answer in Phase 3 was higher 
in Experiment 5 than it was in Experiments 3 and 4 
(FL, 86) = 120.25, p<.001, #7, = 58, 1 — B=1). There 
was also a main effect of condition (F(2, 172) = 235.28, 
p<.001, ie = .73, 1 — B = 1). Higher accuracy on Phase 3 
was seen for correct Phase 1 responses followed by false 
feedback (M =.85, SD =.25) relative to incorrect Phase 1 
responses followed by both true feedback (M=.51, SD 
=.17, (87) = 11.498, p<.001) and false feedback (M = .17, 
SD = .26, t(87) = 22.02, p< .001). An error followed by true 
feedback also produced better performance than an error 
followed by false feedback (¢(87) = 14.16, p< .001). Finally, 
there was an interaction between condition and experiment 
(F(2, 172) = 27.07, p< .001, " = .24, 1 -B= 1), such that 
the proportion of factually correct responses in Phase 3 was 
higher in every condition in Experiment 5 than in Experi- 
ments 3 and 4: initially correct responses followed by false 
feedback (M = .99, SD =.05 and M = .81, SD =.27, respect- 
ively, (86) = 3.25, p = 001); initially wrong answers followed 
by true feedback (M =.63, SD=.17 and M = 47, SD=.15, 
respectively, ¢(86) = 4.08, p <.001); and initially wrong an- 
swers followed by false feedback (M = .55, SD =.20 and M 
= .03, SD =.03, respectively, t(86) = 20.65, p < .001). 


General discussion 

First, these data show that it is not the case that the 
correction of misinformation is indistinguishable from 
misinforming people at the expense of previously held 
correct beliefs. In all five experiments, participants 
hypercorrected when they had made an error and were 
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given correct feedback to remember. In none of the five 
experiments did participants hyper'correct' when they 
had originally been correct and were given false feed- 
back to remember. These results are the first to show 
empirically that individuals respond differentially, re- 
gardless of confidence, as a function of whether they 
were initially correct or incorrect. They also suggest 
that people may be able to use second-order knowledge 
concerning their own metacognitive confidence judg- 
ments (Buratti et al., 2013; Dunlosky, Serra, Matvey, & 
Rawson, 2005). 

Second, although these data support the idea that 
people have some knowledge that differentiates correct 
and incorrect responses—and results in differential pro- 
cessing of correct responses overwriting errors than for 
misinformation overwriting correct answers, despite be- 
ing equally confident in the correctness of these correct 
and incorrect responses initially—the pattern of the dif- 
ferences was only slightly more exaggerated and clear 
when people were provided with explicit knowledge of 
whether they were initially correct or incorrect, and 
whether the feedback they received in response to their 
answers was true or false. In the experiments in which 
participants were given this explicit knowledge (Experi- 
ments 2 and 5), compared to those in which such know- 
ledge was not provided (Experiments 1, 3, and 4), 
similar trends emerged. 

Third, when participants had made errors initially, not 
only did they hypercorrect when given correct answers, 
but they also hyper'corrected' when they were given false 
feedback. The fact that participants did not distinguish 
true and false feedback following an error is remarkable. 
When people were wrong with high confidence, they 
were particularly vulnerable to further misinformation. 


The locus of differences in memory for true or false 
feedback 

Evaluation of the feedback 

An obvious source of the differential processing of 
feedback following correct answers and errors may be 
that participants are able to evaluate the truth of the 
feedback that they were given, and they use this evalu- 
ation, along with the additional information that the 
feedback brings to mind, to bias updating in memory. 
That telling participants whether the feedback was true 
or false made almost no difference to their processing 
suggests that they may have already made this evalu- 
ation on their own. 

A rapid online feature-matching model illustrates this 
idea, in which memory is updated as a function of the 
participants’ initial response—correct or incorrect—and 
the feedback given—true or false (Fig. 5). Here, bars in- 
dicate the accuracy of relevant potential features about 
answers to the question “What is the largest state that’s 
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east of the Mississippi River?” that a participant may use 
to update memory. In Fig. 5(a), the participant incor- 
rectly responds “Minnesota,” a large state that borders 
the Mississippi River. However, possible disconfirming 
information is not considered. The feedback “Georgia” 
makes this participant realize that Minnesota is too far 
west. Crucially, this discriminating information (see 
Gigerenzer, Hoffrage, & Kleinbdlting, 1991; Koriat, 2012) 
is triggered by the feedback, but was not considered dur- 
ing the initial evaluation of their answer to the question. 
Realizing that they were wrong, they hypercorrect this 
true feedback. 

In Fig. 5(b), the participant is confident that Georgia is 
one of the biggest states east of the Mississippi, and also 
they have coded that Georgia is east of the Mississippi. 
When they receive the false feedback “Texas,” they know 
immediately that this cannot be a correct response be- 
cause Texas is west of Louisiana, which borders the Mis- 
sissippi to the east, and so they do not hyper'correct' to 
the false feedback. 

The last case, illustrated in Fig. 5(c), is potentially the 
most interesting, from the perspective of understanding 
why misinformation is adopted. Here, the participant re- 
sponds “Minnesota” and receives the false feedback 
“Texas.” If this person was unsure of where the Missis- 
sippi River is geographically (which in this hypothetical 
example is clear, as their initial response was Minnesota, 
a state that is west of the Mississippi), but does know 
that Texas is the largest state in the continental U.S., 
they may hyper'correct'. They realize that their high-con- 
fidence response may be wrong and they get no informa- 
tion from the feedback that is disconfirming, and so they 
are willing to accept the feedback regardless of whether 
it is true or false and shift their belief towards this false 
feedback. This interpretation is supported by the fact 
that in all three of the experiments (3—5) in which false 
feedback was provided to incorrect responses, partici- 
pants showed a hyper'correction’ effect. 

The speed of this postulated evaluative process and 
the ensuing enhanced processing (for a correct answer 
following an error) or the inhibition of the enhanced 
processing (for erroneous feedback following a correct 
response) is striking. Event-related potential (ERP) evi- 
dence indicates that an exaggerated P3a component re- 
lated to the feedback to high-confidence errors occurs 
roughly 400 ms post-feedback (Metcalfe et al., 2015). 
Given this, it is likely that people have some additional 
knowledge—above and beyond their stated confidence— 
that they bring to bear even before the feedback is 
presented. 


Pre-feedback knowledge 
There is considerable evidence that people may have ac- 
cess to other knowledge that is related to their 
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confidence judgments. A number of studies have indi- 
cated that a particularly dense semantic landscape is re- 
lated to highly confident erroneous responses (Eich et 
al., 2013; Metcalfe & Finn, 2012, 2013; Metcalfe & Miele, 
2014; Sitzman & Rhodes, 2010; Sitzman et al., 2015) and 
that the density of knowledge may facilitate further 
learning. Butterfield and Mangels (2003) and Butterfield 
and Metcalfe (2006) provide evidence that people are 
more familiar with high- than low-confidence ques- 
tions, as well as with the responses, both correct and 
incorrect, associated with them—a result which has 
been borne out by LSA of the relation of the errors to the 
correct answers (Eich et al., 2012; Metcalfe & Finn, 2012; 
Metcalfe & Miele, 2014). Metcalfe and Finn (2012, 2013) 
showed that once people are provided with the correct an- 
swer following a highly confident compared to a 
low-confidence error, they frequently say they “knew it all 
along.” Thus, even without feedback, participants are 
more likely to produce the correct answer on a second 
guess, to choose it on a multiple-choice test, and to guess 
it when given clues. Given that confidence is related to 
the density of the semantic landscape, and that the 
density of the semantic landscape facilitates learning, it 
is possible that the density of the semantic landscape is 
responsible for the hypercorrection effect. This view is 
bolstered by the fact that hyper'correction', in the 
present experiments, was observed when people were 
wrong with high confidence and incorrect answers were 
provided to them to learn. However, presumably the 
density of the semantic neighborhood increases with 
confidence in correct (Tunney, 2010) as well as incor- 
rect answers. Thus, this underlying knowledge density 
factor should facilitate both the updating of correct as 
well as incorrect responses. However, in the present 
studies, a hyper'correction' effect was not observed for 
correct answers, suggesting that some other factor is 
involved. 

It is, of course, possible that participants were disin- 
genuous about their initial confidence ratings, or that 
there was no equivalency between ratings made for in- 
correct versus correct responses at the time that the rat- 
ing was made. In other words, a participant who rates 
their confidence as 90% and gives the correct response 
might not mean the same thing as when they give a 90% 
confidence rating and gives an incorrect response. While 
this may seem, at first, unlikely, given that confidence 
ratings were made before feedback was given, this possi- 
bility is not entirely implausible. 

In other paradigms, people sometimes appear to have 
knowledge that is not captured by simple metacognitive 
ratings. For example, when people are in a tip-of-the-- 
tongue state they typically express very high 
feeling-of-knowing ratings (see, Schwartz & Brown, 
2014). However, they also sometimes know—and can 
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report if asked—when they are experiencing a blocker 
that they are certain is wrong (see, Kornell & Metcalfe, 
2006). Similarly, Lindsay and Johnson (1989) showed that 
although people will confidently affirm that they saw a mis- 
leading event in one context, if they are pressed more dir- 
ectly about the context of the event, they can reveal that 
they know that the experience did not occur in the queried 
context after all. They based their original judgment 
upon confirmatory fluency information, but can further 
scrutinize their memory for disconfirming information 
that alters the simpler assessment. Son (2010) observed 
that even when participants indicated that they were 
100% confident that they had fully learned a particular 
item, when asked a subsequent question of whether 
they wanted to restudy the item, they would often opt 
to do so. Presumably, an item that was fully learned 
with absolute certainty would not need to be restudied. 
Thus, in this case, the metacognitive measure indicated 
that they believed they knew the answer, but their sub- 
sequent choices indicated that they knew they did not. 

Furthermore, several studies have shown that asking a 
metacognitive question in different ways can produce dif- 
ferent results. Koriat and Bjork (2005) showed that when 
people were asked to give their confidence ratings about 
how likely it was that they would remember an item, their 
subjective expected probability of recall was greater than 
when the experimenters reframed the question to enquire 
about how likely it was that they would forget the item. 
Presumably—while technically asking for the same infor- 
mation—the question about remembering evoked con- 
firmatory evidence while the forgetting question provoked 
disconfirming evidence, producing different results. Finn 
(2015) showed that this difference in judgments had con- 
sequences. It played out in study choices, wherein people 
chose to study more in the latter than in the former case. 
Perhaps, if subjects in the present study had been asked 
how confident they were that their responses had been 
false rather than true, the results would have been differ- 
ent. People may have more knowledge about the truth of 
their answers, even prior to receiving feedback, than a sin- 
gle simple query about confidence taps. 

A number of studies have asked people to make re- 
sponses and confidence judgments multiple times to the 
same question (Koriat, 2012). These studies follow from 
the finding that if several people with different view- 
points are asked a question, the answer that the group 
as a whole produces is more closely aligned with the 
truth than are the answers from individual participants 
(Galton, 1907; Wallsten, Budescu, Erev, & Diederich, 
1997; Yaniv & Milyavsky, 2007), a phenomenon called 
the “wisdom of crowds” (Hertwig, 2012). Similarly, if a 
single individual is tasked with retrieving and making 
corresponding confidence judgments several times, ra- 
ther than just once, the answer obtained by combining 
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the different retrieval events can be more accurate than 
any of the single retrieval events, as Herzog and Hertwig 
(2014) have demonstrated. Fraundorf and Benjamin 
(2014) estimated that the advantage gained from the 
inner crowd is about 1/10 that of using different people, 
and the effects can sometimes be small (Ariely et al., 
2000). Even so, the inner crowd findings demonstrate 
that people may have untapped knowledge about the ac- 
curacy of their retrievals. Additional research is needed 
to determine whether such additional knowledge might 
be responsible for the confidence-related differences we 
observed in the experiments presented here. 

Finally, perhaps people’s confidence judgments were 
systematically inaccurate. Perhaps it really is the case 
that 90% confidence in one context (for example, after 
making an error) is not the same as 90% confidence in a 
different context (for example, after a making a correct 
response). It is possible that people sometimes impul- 
sively and incorrectly express high confidence, and they 
are later able to spot when they did so. This possibility 
could be checked by asking subjects to make metacogni- 
tive judgments about the accuracy of their confidence 
ratings. We were able to find only four studies in which 
people were asked to make such second-order metacog- 
nitive judgments—judgments about their original meta- 
cognitive judgments (Buratti et al., 2013; Buratti & 
Allwood, 2012a, b; Dunlosky et al., 2005). One of these, 
though—the study by Buratti et al. (2013)—is directly 
relevant to the present results. In this study, people an- 
swered semantic memory questions and gave their confi- 
dence about their answers. Later, participants were 
asked if they wanted to change any of their confidence 
ratings. Very few ratings were changed. Indeed, so few 
were altered that the overall accuracy of the relation be- 
tween confidence and accuracy was unaffected. Even so, 
when the authors investigated which particular ratings 
were changed, the results were systematic. The confi- 
dence ratings associated with errors that had been 
assigned high-confidence ratings and those associated 
with correct answers that had been assigned 
low-confidence ratings were chosen. The changes people 
made to these ratings were in the direction of greater 
metacognitive accuracy. These findings favor the idea 
that people do have some access to the truth of their re- 
sponses above and beyond what their initial confidence 
ratings indicate. 

It would appear that all of these lines of research bol- 
ster the idea that people have some knowledge beyond 
what their initial metacognitive ratings indicate. It is not- 
able, though, that in all the above cases, people had to 
be explicitly asked to make the finer-grained judgment 
or to reassess what they had said before. Only then did 
the additional knowledge surface. In our experiments, 
people were never asked to make such a reassessment. If 
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this additional knowledge is responsible for the differ- 
ences between the patterns of updating following re- 
sponses that were correct and those that were incorrect, 
it emerged spontaneously in the experiments described 
here. Whether this second-order metacognitive know- 
ledge is sufficient to account for our findings, then, re- 
mains an empirical question. 

In the present experiments, people answered factual 
questions and were asked, in a straightforward manner, 
to provide confidence judgments indicating how sure 
they were that their answers were true. The data pre- 
sented here provide evidence that people either have ac- 
cess to or gain access to knowledge beyond their original 
confidence judgments about the correctness of their 
own retrieval. The present experiments cannot defini- 
tively answer whether this knowledge was gained 
through evaluation of the feedback that was given in re- 
sponse to the initial answers, or whether it was available 
before the feedback was provided. When this additional 
information about the truth of their answers emerges, 
and how it may be put to use, are questions for further 
empirical research. However, confidence has different 
memorial consequences, depending on whether the an- 
swer was true or false and whether the feedback was 
true or false. It appears, from the research presented 
here, that meta-metaknowledge of the truth of an indi- 
vidual’s own answers—knowledge that is not expressed 
in first-order confidence judgments—differentially im- 
pacts cognitive processing and memory. 


Endnotes 

‘The term hyper'correction' will be used to refer to en- 
hanced memory for to-be-learned new responses as a 
function of high compared to low confidence when the 
new to-be-learned responses are factually incorrect. 

*y correlations between confidence and recall could 
sometimes not be computed (if there were no responses 
in a particular condition, if all responses in a condition 
were correct or all were incorrect, or if confidence 
values did not vary, for instance), which accounts for the 
differences from the expected degrees of freedom in 
some of the results. 

Several answers were not in the LSA database, which 
accounts for the differences in the degrees of freedom. 

“It was not possible to compute y correlations for the 
condition in which participants originally generated the 
correct answer and then were provided with the correct 
answer, because —not surprisingly—participants nearly 
always got these answers correct. 

°For example, if participants answered “Orwell” to the 
question: “What is the last name of the author who 
wrote Brave New World?” they may have also received 
the incorrect but plausible lure “Orwell” as false 
feedback. 
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